Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
                                            Some full text articles may not yet be available without a charge during the embargo (administrative interval).
                                        
                                        
                                        
                                            
                                                
                                             What is a DOI Number?
                                        
                                    
                                
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
- 
            In this paper, we address the problem of a two-player linear quadratic differential game with incomplete information, a scenario commonly encountered in multi-agent control, human-robot interaction (HRI), and approximation methods to solve general-sum differential games. While solutions to such linear differential games are typically obtained through coupled Riccati equations, the complexity increases when agents have incomplete information, particularly when neither is aware of the other’s cost function. To tackle this challenge, we propose a model-based Peer-Aware Cost Estimation (PACE) framework for learning the cost parameters of the other agent. In PACE, each agent treats its peer as a learning agent rather than a stationary optimal agent, models their learning dynamics, and leverages this dynamic to infer the cost function parameters of the other agent. This approach enables agents to infer each other’s objective function in real time based solely on their previous state observations and dynamically adapt their control policies. Furthermore, we provide a theoretical guarantee for the convergence of parameter estimation and the stability of system states in PACE. Additionally, using numerical studies, we demonstrate how modeling the learning dynamics of the other agent benefits PACE, compared to approaches that approximate the other agent as having complete information, particularly in terms of stability and convergence speed.more » « lessFree, publicly-accessible full text available May 30, 2026
- 
            Free, publicly-accessible full text available June 24, 2026
- 
            Free, publicly-accessible full text available December 15, 2025
- 
            Free, publicly-accessible full text available August 1, 2026
- 
            Free, publicly-accessible full text available December 3, 2025
- 
            Free, publicly-accessible full text available July 8, 2026
- 
            The values of two-player general-sum differential games are viscosity solutions to Hamilton-Jacobi-Isaacs (HJI) equations. Value and policy approximations for such games suffer from the curse of dimensionality (CoD). Alleviating CoD through physics-informed neural networks (PINN) encounters convergence issues when value discontinuity is present due to state constraints. On top of these challenges, it is often necessary to learn generalizable values and policies across a parametric space of games, eg, for game parameter inference when information is incomplete. To address these challenges, we propose in this paper a Pontryagin-mode neural operator that outperforms existing state-of-the-art (SOTA) on safety performance across games with parametric state constraints. Our key contribution is the introduction of a costate loss defined on the discrepancy between forward and backward costate rollouts, which are computationally cheap. We show that the discontinuity of costate dynamics (in the presence of state constraints) effectively enables the learning of discontinuous values, without requiring manually supervised data as suggested by the current SOTA. More importantly, we show that the close relationship between costates and policies makes the former critical in learning feedback control policies with generalizable safety performance.more » « less
- 
            Free, publicly-accessible full text available May 19, 2026
- 
            Unmanned aerial manipulators have been growing in popularity over the years, alongside the complexity of the tasks they undertake. Many of these tasks include physical interaction with the environment, where a force control or sensing component is desirable. In these types of applications, the forces and torques, or the wrench, acting on the robot by the environment must be known. This paper presents a wrench observer based on an Extended Kalman filter (EKF), and compares it against acceleration-based, momentum-based, and hybrid wrench observers. Simulations using each of these observers are conducted with an underactuated aerial manipulator composed of a hexarotor with coplanar propellers and a 2-DOF manipulator. Measurement noise on par with what is expected in real-world applications is added to the sensor signals, and results show that the EKF-based wrench observer is superior at noise reduction and wrench estimation in many cases compared to the other observers.more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
 
                                     Full Text Available
                                                Full Text Available